Human recombinant growth and differentiaton factor-5 (rhgdf-5)

ABSTRACT

Expression vector systems are provided for increased production of a recombinant GDF-5 (rhGDF-5) protein. Also provided are transformed host cells that were engineered to produce and express high levels of rhGDF-5 protein. Methods for production and high expression of rhGDF-5 protein are disclosed herein. The methods of enhancing production and protein expression of rhGDF-5 protein as disclosed are cost-effective, time-saving and are of manufacturing quality.

BACKGROUND

The present disclosure relates generally to a recombinant human growth and differentiation factor-5 (rhGDF-5) protein and, specifically to expression vector systems for increased production of rhGDF-5, host cells or cell lines for producing rhGDF-5, methods of producing rhGDF-5 using the host cells or cell lines and methods of enhancing production and protein expression of rhGDF-5 protein that are cost-effective, time-saving and manufacturing quality.

Biologics, a therapeutic product, can be made by genetically engineering living cells and requires a high level of precision and care and various factors for its manufacturing process to yield a consistent biologic product each time. For example, a biologics that is produced by recombinant host cells, either in prokaryotes or eukaryotes, can be influenced by (i) individual cell characteristics and (ii) the environment and nutrients provided during the manufacturing process. An example of a biologics is Growth and Differentiation Factor-5 (GDF-5).

GDF-5 belongs to the Bone Morphogenetic Protein (BMP) family, which itself is a subclass of the transforming growth factor-β superfamily of proteins. There are several variants and mutants of GDF-5 (GDF family members), some of which include the first isolated mouse GDF-5 (U.S. Pat. No. 5,801,014); MP52, a human form of GDF-5 (hGDF-5; (WO 95/04819)) or LAP-4 (Triantfilou et al., Nature Immunology 2, 338-345, 2001); cartilage-derived morphogenetic protein (CDMP)-1, an allelic protein variant of hGDF-5 (Chang, S. C. et al., J. Biol. Chem. 269(45):28227-34 (1994); WO 96/14335); rhGDF-5, a recombinant human form prepared from bacteria (EP 0955313); rhGDF-5-Ala83, a monomeric variant of rhGDF-5; BMP-14, a collective term for hGDF-5/CDMP-1 like proteins; SYNS2; Radotermin, the international non-proprietary name designated by the World Health Organization; HMW MP52's, high molecular weight protein variants of MP52; C465A, a monomeric version wherein the cysteine residue responsible for the intermolecular cross-link is substituted with alanine; also other active monomers and single amino acid substitution mutants including N445T, L441 P, R438L, and R438K.

The GDF-5 family members share common structural features including a carboxy terminal active domain and is characterized by a polybasic proteolytic processing site, which can be cleaved to release a mature protein containing seven conserved cysteine residues. The conserved pattern of cysteine residues creates 3 intra-molecular disulfide bonds and one inter-molecular disulfide bond. The active form can be either a disulfide-bonded homodimer of a single family member or a heterodimer of two different members (Massague et al., Ann. Rev. Cell Biol. 6:957 (1990); Sampath et al., J. Biol. Chem. 265:13198 (1990); Celeste et al., Proc. Natl. Acad. Sci. USA 87:9843-7 (1990); U.S. Pat. No. 5,011,691 and U.S. Pat. No. 5,266,683). The proper folding of the GDF-5 protein and formation of these disulfide bonds are essential to biological functioning, and misfolding leads to inactive aggregates and cleaved fragments.

GDF-5 is expressed in the developing central nervous system (O'Keeffe, G. et al., J. Neurocytol. 33(5):479-88 (2004) and has a role in skeletal and joint development (Buxton, P. et al., J. Bone Joint Surg. Am. 83-A, S1(Pt. 1):523-30 (2001); Francis-West, P. et al., Development 126(6):1305-15 (1999); Francis-West, P. et al., Cell Tissue Res. 296(1):111-9 (1999)). The GDF-5 family members are regulators of cell growth and differentiation in both embryonic and adult tissues. For example, GDF-5 may induce angiogenesis in the bone formation process (Yamashita, H. et al., Exp. Cell Res. 235(1):218-226 (1997); CDMP-1 stimulates activity of articular chondrocytes thereby contributing to the integrity of the joint surface (Erlacher, L. et al., Arthritis Rheum. 41(2):263-73 (1998)). Changes in expression patterns of GDF-5 and its receptors are associated with human articular chondrocyte dedifferentiation (Schlegel, W. et al., J. Cell Mol. Med. 13(9B):3398-404 (2009)). As a growth factor, GDF-5 (CDMP) may stimulate proteoglycan production in the human degenerate intervertebral disc (Le Maitre, C. L. et al., Arthritis Res. Ther. 11(5):R137 (2009)). It may increase the survival of neurons that respond to a dopamine neurotransmitter and can be a potential therapeutic molecule associated with Parkinson's disease. (Sullivan and O'Keeffe, J. Anat. 207(3):219-26 (2005)). When rhGDF-5 was delivered on beta-tricalcium phosphate, an effective encouragement of periodontal tissue regeneration in non-human primates was observed. In tissues critical for periodontal repair (e.g. alveolar bone, cementum and periodontal ligament), rhGDF-5 treatment on these tissues showed evidence of regeneration and the response was found to be dose-dependent (Emerton, K. B. et al., J. Dental Res. 90(12):1416-21 (2011). Based on this finding and other similar reports, a biologics such as GDF-5 may offer new approaches or options to regenerate bone during dental implant placement and may save a tooth in patients who are at risk for tooth loss due to periodontal disease.

GDF-5 gene mutations can be associated with the following health conditions, e.g., acromesomelic chrondrodysplasia Grebe type (AMDG; (Thomas, J. T. et al., Nat. Genet. 1:58-64 (1997);), Hunter-Thompson type (AMDH; (Thomas, J. T. et al., Nat. Genet. 3:315-7 (1996)); brachydactyly type C (BDC; Francis-West, P. H. et al., Development, 126(6):1305-15 (1999), Everman, D. B. et al., Am. J. Med. Genet., 112(3):291-6 (2002), Schwabe, G. C. et al., Am. J. Med. Genet. A. 124A(4):356-63 (2004)); DuPan syndrome (DPS), which is also known as fibular hypoplasia and complex brachydactyly (Faiyaz-Ul-Hague, M. et al., Clin. Genet. 61(6):454-8 (2002)); Mohr-Wriedt brachydactyly type A2 (Kjaer, K. W. et al., J. Med. Genet. 43(3):225-31 (2006)); multiple synostoses syndrome type 2 (SYNS2; Dawson, K. et al., Am. J. Human Genet. 78(4):708-12 (2006), Schwaerzer, G. K. et al., J. Bone Miner. Res. 27(2):429-42 (2012)); semidominant brachydactyly A1 (BA1; Byrnes, A. M. et al., Hum. Mutat. 31(10): 1155-62 (2010)); symphalangism (SYM1; Yang, W. et al., J. Hum. Genet. 53(4):368-74 (2008)) or brachydactyly type A2 (BDA2; Seemann, P. et al., J. Clin. Invest. 115(9):2373-81 (2005), Ploger, F. et al., Hum. Mol. Genet. 53(4):368-74 (2008)); susceptibility to osteoarthritis type 5 (OS5; Masuya, H. et al., Hum. Molec. Genet. 16:2366-75 (2007), Miyamoto, Y. et al., Nature Genet. 39:529-53 (2007)); knee osteoarthritis in Thai ethnic population (Tawonsawatruk, T. et al., J. Orthop. Surg. Res. 6:47 (2011)). GDF5 gene variants have been associated with hand, knee osteoarthritis and fracture risk in elderly women, which replicates the previous association between GDF5 variation and height. (Vaes, R. B. et al., Ann Rheum. Dis. 68(11):1754-60 (2009)). All of these associations confirmed that the GDF-5 gene product may play a role in skeletal development.

Expression of GDF-5-related proteins using recombinant DNA techniques has been done and their purification and production for industrial scale have also been explored. See for example, Hötten, U.S. Pat. No. 6,764,994; Makishima, U.S. Pat. No. 7,235,527; Ehringer, U.S. Pat. No. 8,187,837). Both Hötten and Makishima described (1) a complete DNA nucleotide sequence that codes for the TGF-β protein MP-52 and the complete amino acid sequence of MP52; and (2) a composition containing a pharmaceutically active amount of the MP-52 for wound healing and tissue regeneration, treating cartilage and bone diseases and dental implants. According to Makashina, isolation of pure MP-52 at least with the mature region from the mixture was difficult (Makashina, column 1, lines 59-61). To overcome this obstacle, Makashina constructed a DNA plasmid wherein a codon encoding methionine was linked to the DNA sequence that encodes for a 119-amino acid residue protein (MP-52) and wherein the N-terminal alanine of the mature MP-52 protein (120-amino acid residue) was eliminated. Ehringer, on the other hand, described an advanced method for the efficient prokaryotic production and purification of GDF-5 related proteins that resulted in better protein yield, high product purity and improved industrial applicability. Problems encountered during the purification and refolding of the GDF-5-related proteins in large scale were disclosed and addressed.

The use of prokaryotic expression vectors such as bacterial plasmids for expressing preventive or therapeutic peptides (biologics) is very critical and beneficial not only for biochemical research and biotechnology but even more so for medical therapy. Such use is the basis of many biologics manufacturing processes. High-cell density (HCD) fermentation methods that employ these processes offer many advantages over traditional methods in that the final product concentrations are higher, downtime and water usage are reduced, and overall productivity is improved resulting in lower set-up and operating costs.

The recombinant protein and plasmid DNA production typically involves: (1) bacterial propagation and fermentation production, wherein a plasmid encoding a gene of interest is transformed into a bacterial cell, typically Escherichia coli (E. coli), propagated to make master and working cell banks, and further grown in a bioreactor (e.g., fermentor) to make production cells that contain high yields of the plasmid; and (2) purification and formulation stability, wherein the production cells are lysed and plasmid DNA carrying the gene of interest is purified by a plurality of purification methods and formulated for delivery. Expression is particularly higher if the gene of interest is codon optimized to match that of the target organism, which leads to improved gene function and increased protein expression, which ultimately leads to cost-effectiveness of mass producing the recombinant protein.

Plasmid fermentation processes for plasmid production should be optimized to retain a high percentage of supercoiled plasmid. Other plasmid forms are difficult to eliminate during purification and their presence are undesirable. Fermentation media and processes needs to be optimized for plasmid yield, plasmid quality and compatibility of the resultant cells for harvest and lysis. There are about three fermentation processes that can be utilized to initiate production, namely: batch, batch-fed or continuous fermentation processes. For a large scale production, a batch fermentation that generally yields about 10-20 mg/L of plasmid DNA has its limitations such as uncontrolled growth rates and waste product accumulation (e.g., production of reduced carbon metabolites such as acetates, lactates and formates) that ultimately would lead to inhibition of bacterial growth. To prevent these issues from occurring and to increase plasmid yield, fed-batch or continuous high cell density fermentation can be a better option. Continuous fermentation processes are more conducive to the production of large amounts of a single product but sterility remains an issue. Fed-batch fermentation begins with a short batch fermentation and is proceed by the addition of media at a defined rate. It is more flexible and consistent than the batch method and allows for simple optimization of fermentation profiles for each plasmid DNA product. When employing a defined growth rate strategy as a form of feed strategy, a feed media is added at rates that are determined based on an pre-established growth profile, wherein the feed is triggered by an initial DO2 spike (caused by the exhaustion of initial bolus of glucose in the media). Peterson, M. and Brune, M., in BioPharm International Supplements entitled: “Maximizing Yields of Plasmid DNA Processes,” Jun. 2, 2008.

Chemically-defined (minimal) media contain known quantities of ingredients added to purified water. The absence of animal-derived components in chemically-defined media may be more desirable from a regulatory standpoint due to concerns over BSE/TSE (spongiform encephalopathy/transmissible spongiform encephalopathy). They have reproducibility (their components have known chemical structures that can allow consistent performance of cells in the medium), greater simplicity of both downstream processing and the analysis of product and greater control of feeding strategy when carbon sources are known.

Complex media, on the other hand, are digests of food and agriculture by-products (i.e. protein hydrolysate and yeast extract). They can provide a majority of needed nutrients to host cell (e.g., Escherichia coli) fermentation. They may produce high yields at lower costs (thus, more cost-effective) and less control over individual components and possibly vary from lot-to-lot.

Semi-defined media contain small concentrations of complex ingredients usually from about 0.05 to about 0.5% added to a chemically defined media. Semi-defined media can maximize performance while minimizing downstream processing issues. Small amount of complex material may provide enough nutrients to enhance growth of microorganisms without interfering with recovery or analysis.

Given the role of GDF-5 in cell growth and differentiation, in particular, skeletal and joint development and bone regeneration, there is a critical need for a therapeutic rhGDF-5 biologics that can be manufactured in large scale processes. There is an urgent need for improving the manufacturing process of rhGDF-5 that can be cost-effective, time-saving and manufacturing quality.

SUMMARY

The present disclosure includes methods and compositions for the production of rhGDF-5 using the T5 or Trc promoter in the production of rhGDF-5 for therapeutic applications. The rh-GDF-5 can be easily produced in large scale quantities in cost-effective, and time-saving manner.

In various embodiments, there is an expression vector comprises a T5 or a Trc promoter operably linked to a polynucleotide sequence that encodes a GDF-5 protein.

In various embodiments, a host cell line engineered to express rhGDF-5 protein by the expression of a vector is also provided.

In various embodiments, there is a method for producing a rhGDF-5 (rhGDF-5) polypeptide comprises: providing a prokaryotic host cell comprising an expression vector which comprises a polynucleotide sequence encoding a polypeptide sequence under the control of a T5 or Trc promoter; cultivating the prokaryotic host cell under suitable conditions so as to induce or promote the expression of the polynucleotide sequence of SEQ ID NO: 1 in the expression vector; and recovering the rhGDF-5 polypeptide.

Additional features and advantages of various embodiments will be set forth in part in the description that follows, and in part will be apparent from the description, or may be learned by practice of various embodiments. The objectives and other advantages of various embodiments will be realized and attained by means of the elements and combinations particularly pointed out in the description and appended claims.

BRIEF DESCRIPTION OF THE FIGURES

In part, other aspects, features, benefits and advantages of the embodiments will be apparent with regard to the following description, appended claims and accompanying drawings where:

FIGS. 1A and 1B are plasmid maps of pGDF5-T5 and pGDF5-Trc expression vectors, respectively.

FIG. 1C shows a protein alignment of the theoretical amino acid sequence encoded by pGDF5-Trc and that of a commercially-known rhGDF-5 protein, as set forth in the Sequence Listing as SEQ ID NOs: 4 and 5, respectively.

FIGS. 2A and 2B are agarose gels showing NdeI-linearized plasmid DNAs prepared from pGDF5-T5- and pGDF5-Trc-transformed DH10β and STBL2 clones.

FIGS. 3A and 3B are Western blots showing the absence and presence of GDF-5 protein in the supernatant and pellet fractions of selected pGDF5-Trc-transformed clones, respectively.

FIG. 3C shows the lanes and samples that correspond to the Western blots of FIGS. 3A and 3B.

FIG. 4A is a Western blot showing over-expression of GDF-5 in pGDF5-Trc-transformed Clones 1 and 4.

FIG. 4B shows the growth profile and rate of pGDF5-Trc-transformed HMS174 Clones 1 and 4.

FIGS. 5A and 5B shows the growth profiles of pGDF5-Trc-transformed HMS174 Clones 1 and 4 when grown using the ultra yield shake flask and 5 L Applikon Fermentor, respectively.

FIG. 6A is a Coomassie brilliant blue-stained gel showing GDF-5 protein production from supernatant and pellet fractions of pGDF5-Trc-transformed HMS174 Clones 1 and 4 that were grown by either using the ultra yield shake flask method or the 5 L Applikon Fermentor method.

FIG. 6B is a Western blot showing GDF-5 over-expression from supernatant and pellet fractions of pGDF5-T5- and -Trc-transformed HMS174 Clones 1 and 4 that were grown by either using the ultra yield shake flask method or the 5 L Applikon Fermentor method.

FIG. 7 is an exemplary formulation of a high cell density media according to the embodiment of the present disclosure.

FIGS. 8A, 8B and 8C show the enhancing effects of sodium molybdate, magnesium sulfate, heptahydrate and sodium chloride on rhGDF-5 expression when these three components were added to Media 1, which was optimized based on its improved response to rhGDF-5 expression in the cultured pGDF5-Trc-transformed host cells. Data obtained were evaluated using statistical software.

FIG. 9A and 9B show an increase of rhGDF-5 expression by the addition of yeast extract and magnesium sulfate into Media 2 (optimized based on its improved response to rhGDF-5 expression) but the addition of sodium molybdate decreased rhGDF-5 expression in pGDF5-Trc-transformed host cells. Data obtained were evaluated using statistical software.

FIGS. 9C and 9D show biomass optimization by the addition of yeast extract while negatively affected by the addition of sodium chloride and MOPS when these three components were added to Media 3, which was optimized based on its improved response to biomass yield in the growth of pGDF5-Trc-transformed host cells. Data obtained were evaluated using statistical software.

FIGS. 9E and 9F show the results for the optimization of the culture media.

FIG. 10 is a Coomassie brilliant blue-stained gel showing the level of GDF-5 protein production of pGDF5Trc-transformed HMS174 cells when grown under different types high cell density media that were designed with respect to their optimized response to either rhGDF-5 expression (protein production) and biomass yield (growth rate): Media 1 (defined media improved rhGDF-5 expression); Media 2 (semi-defined media improved rhGDF-5 expression); and Media 3 (semi-defined media improved biomass yield).

FIGS. 11A-C are Coomassie brilliant blue-stained gels showing the level of GDF-5 protein production and expression of pGDF5Trc-transformed HMS174 cells when grown under (a) pH 6.5 (condition A); (b) pH 7.1 (condition B); and (c) pH 6.8 at low oxygen (condition C), respectively.

FIG. 11D shows the effect of pH and oxygen on protein expression and growth rate of pGDF5Trc-transformed HMS174 cells.

FIG. 11E shows the effect of pH and oxygen on the ratio of two major expressed bands, 40 kDa:14 kDa of pGDF5Trc-transformed HMS174 cells.

It is to be understood that the figures are not drawn to scale. Further, the relation between objects in a figure may not be to scale, and may in fact have a reverse relationship as to size. The figures are intended to bring understanding and clarity to the structure of each object shown, and thus, some features may be exaggerated in order to illustrate a specific feature of a structure.

DETAILED DESCRIPTION

For the purposes of this specification and appended claims, unless otherwise indicated, all numbers expressing quantities of ingredients, percentages or proportions of materials, reaction conditions, and other numerical values used in the specification and claims, are to be understood as being modified in all instances by the term “about.” Accordingly, unless indicated to the contrary, the numerical parameters set forth in the following specification and attached claims are approximations that may vary depending upon the desired properties sought to be obtained by the present application. At the very least, and not as an attempt to limit the application of the doctrine of equivalents to the scope of the claims, each numerical parameter should at least be construed in light of the number of reported significant digits and by applying ordinary rounding techniques.

Notwithstanding that the numerical ranges and parameters setting forth the broad scope of the present disclosure are approximations, the numerical are as precise as possible. Any numerical value, however, inherently contains certain errors necessarily resulting from the standard deviation found in their respective testing measurements. Moreover, all ranges disclosed herein are to be understood to encompass any and all subranges subsumed therein. For example, a range of “1 to 10” includes any and all subranges between (and including) the minimum value of 1 and the maximum value of 10, that is, any and all subranges having a minimum value of equal to or greater than 1 and a maximum value of equal to or less than 10, e.g., 5.5 to 10.

Additionally, unless defined otherwise or apparent from context, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art to which this disclosure belongs.

Unless explicitly stated or apparent from context, the following terms are phrases have the definitions provided below:

It is noted that, as used in this specification and the appended claims, the singular forms “a,” “an,” and “the” include plural referents unless expressly and unequivocally limited to one referent.

For the purposes of this application the term “GDF-5” is meant to include all variants and mutants of the GDF-5 protein, and rhGDF-5 is an exemplary member having 125 amino acids as set forth in the Sequence Listing as SEQ ID NO:4.

The term “cysteine-knot domain” refers to a conserved cysteine-rich amino acid region that is present in the mature parts of TGF-β superfamily proteins, such as i.e. human GDF-5 and forms a three-dimensional protein structure known as cysteine-knot. It has been shown that the cysteine-knot domain alone is sufficient for the biological function of the protein (Schreuder et al., Biochem. Biophys. Res. Commun. 329:1076-86(2005)). Consensus sequences for cysteine-knot domains are well known in the state of the art. The cysteine-knot-domain of a protein starts with the first cysteine residue participating in the cysteine-knot of the respective protein and ends with the residue which follows the last cysteine participating in the cysteine-knot of the respective protein. For example, the cysteine-knot domain of the human GDF-5 precursor protein consists of the amino acids 24-125 (see the underlined region of the amino acid sequence of SEQ ID NO:4 encoded by pGDF5-Trc as shown in FIG. 1C).

The term “recombinant” indicates that the material (e.g., a nucleic acid or a polypeptide) has been artificially or synthetically (i.e., non-naturally) altered by human intervention. The alteration can be performed on the material within, or removed from, its natural environment or state. For example, a “recombinant nucleic acid” is one that is made by recombining nucleic acids, e.g., during cloning, DNA shuffling or other well-known molecular biological procedures. A “recombinant DNA molecule” is comprised of segments of DNA joined together by means of such molecular biological techniques. The term “recombinant protein” or “recombinant polypeptide” as used herein refers to a protein molecule which is expressed using a recombinant DNA molecule. A “recombinant host cell” is a cell that contains and/or expresses a recombinant nucleic acid.

A “polynucleotide sequence” or “nucleotide sequence” or “nucleic acid sequence,” as used interchangeably herein, is a polymer of nucleotides, including an oligonucleotide, a DNA, and an RNA, a nucleic acid or a character string representing a nucleotide polymer, depending on context. From any specified polynucleotide sequence, either the given nucleic acid or the complementary polynucleotide sequence can be determined. Included is DNA or RNA of genomic or synthetic origin which may be single- or double-stranded and represent the sense or anti-sense strand.

The term “oligonucleotide” as used herein includes naturally occurring, and modified nucleotides linked together by naturally occurring and non-naturally occurring oligonucleotide linkages. Oligonucleotides are a polynucleotide subset generally comprising a length of 200 bases or fewer. Preferably oligonucleotides are 10 to 60 bases in length and most preferably 12, 13, 14, 15, 16, 17, 18, 19, or to 40 bases in length. Oligonucleotides are usually single stranded, e.g. for primers and probes; although oligonucleotides may be double stranded, e.g. for use in the construction of a gene mutant. Oligonucleotides of the present disclosure can be either sense or anti-sense oligonucleotides.

As used herein, the terms “nucleic acid molecule encoding,” “DNA sequence encoding” and “DNA encoding” refer to the order or sequence of deoxyribonucleotides along a strand of deoxyribonucleic acid. The order of these deoxyribonucleotides determines the order of ribonucleotides along the mRNA chain, and also determines the order of amino acids along the polypeptide (protein) chain. The DNA sequence thus codes for the RNA sequence and for the amino acid sequence.

“Expression of a gene” or “expression of a nucleic acid” means transcription of DNA into RNA (optionally including modification of the RNA, e.g., splicing), translation of RNA into a polypeptide (possibly including subsequent post-translational modification of the polypeptide), or both transcription and translation, as indicated by the context.

As used herein the term “coding region” when used in reference to a structural gene refers to the nucleotide sequences which encode the amino acids found in the nascent polypeptide as a result of translation of an mRNA molecule.

Recombinant DNA-mediated protein expression techniques are applicable to the making of the rhGDF-5 protein. Briefly, a recombinant DNA molecule or construct (pGDF5-T5 or pGDF5-Trc, the polynucleotide sequences of which are set forth in the Sequence Listing as SEQ ID NOS: 2 and 3, respectively), coding for the gene of interest (GDF-5, the polynucleotide sequence as set forth in Sequence Listing as SEQ ID NO:1) is prepared. Methods of preparing such DNA molecules are well known in the art. For instance, sequences encoding the gene of interest (e.g. rhGDF-5 or GDF-5 (SEQ ID NO: 1)) can be excised from DNA using suitable restriction enzymes. Any of a large number of available and well-known host cells may be used in the practice of this present disclosure. The selection of a particular host is dependent upon a number of factors recognized by the art. These include, for example, compatibility with the chosen expression vector, toxicity of the peptides encoded by the DNA molecule, rate of transformation, ease of recovery of the peptides, expression characteristics, biosafety and costs. A balance of these factors must be struck with the understanding that not all hosts may be equally effective for the expression of a particular DNA sequence. Within these general guidelines, useful microbial host cells in culture include bacteria such as Escherichia coli sp. Modifications can be made at the DNA level, as well. For example, the GDF-5 encoding DNA sequence (SEQ ID NO: 1) may be changed to codons more compatible with the chosen host cell. For E. coli, optimized codons are known in the art. Codons can be substituted to eliminate restriction sites or to include silent restriction sites, which may aid in processing of the DNA in the selected host cell. The transformed bacterial host cell line or cell strain is then cultured and purified. Host cells or strains may be cultured under conventional fermentation conditions so that the desired compounds are expressed. Such fermentation conditions are well known in the art.

The region of the vector to which the gene of interest is cloned is referred to herein as an “insertion site.” Preferably, the gene of interest is rhGDF-5 or GDF-5, designated in the Sequence Listing as SEQ ID NO: 1.

In one embodiment, the vector comprises an NdeI restriction site for restriction enzyme analysis purposes.

The term “expression vector” according to the embodiment of the present disclosure refers to a vehicle for introducing a gene of interest into a host cell to express the gene or a recombinant DNA molecule containing a desired coding sequence and appropriate nucleic acid sequences necessary for the expression of the operably linked coding sequence in a particular host cell. Nucleic acid sequences necessary for expression in prokaryotes include a promoter, optionally an operator sequence, a ribosome binding site and possibly other sequences. Expression vectors or vectors according to the embodiment of the present disclosure include plasmid vectors.

In one embodiment, the expression vectors of the present disclosure may include regulatory promoters, examples of which may include but are not limited to, T5, T7 and Trc promoters. The regulatory promoters of the present disclosure can be induced by isopropyl-β-D-thiogalactoside (IPTG).

The expression vectors of the present disclosure, which is provided for inducing high expression of a gene of interest (GDF-5 (SEQ ID NO:1)) in the host cells, may preferably further include a resistance gene for host cells, which is used as a selectable marker for permanent expression of the gene in the host cells. Non-limiting examples of such resistance genes for animal cells include those commonly used in the art, such as ampicillin-, neomycin-, kanamycin-, zeomycin- and hygromycin-resistant genes. A resistance gene, according to the embodiment of the present disclosure, is the Kanamycin-resistant (Kan^(r) gene).

According to one embodiment of the present disclosure, the term “host cell” is used to refer to a cell which has been transformed, or is capable of being transformed with a nucleic acid sequence and then of expressing a selected gene of interest (GDF-5, SEQ ID NO:1). Thus, a host cell, as used herein, is also a transformed cell line (or strain) or a transformant.

The term “recombinant host cell” (or simply “host cell”), as used herein, is intended to refer to a cell into which a recombinant expression vector has been introduced. It should be understood that such terms are intended to refer not only to the particular subject cell but to the progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term “host cell” as used herein.

The term “operably linked” refers to a functional linkage between an expression control sequence and a second nucleic acid sequence, wherein the expression control sequence directs transcription of the nucleic acid corresponding to the second sequence.

A host cell “engineered to overexpress” a protein (or a nucleic acid encoding such protein) is a host cell, including a descendant thereof, that has been altered in such a way that higher levels of such protein are expressed than normal, compared to the unaltered host cell. Thus, included within this category are expression of proteins foreign to the host cell, proteins not naturally expressed by the host cell, or proteins naturally expressed by the host cell at relatively low levels that increase after alteration of the host cell

In a preferred aspect, the recombinant protein of interest (rhGDF-5 designated in the Sequence Listing as SEQ ID NO: 4) needs to be expressed in the prokaryotic host cells. Examples of prokaryotic host cell strains include, but are not limited to DH10β, STBL2, HMS174 and recA.

The term “isolated nucleic acid” refers to a nucleic acid of the present disclosure that is free from at least one contaminating nucleic acid with which it is naturally associated. A “nucleic acid” refers to a DNA or RNA sequence, optionally including artificial bases or base analogs.

The term “identity” (or “percent identical”) is a measure of the percent of identical matches between the smaller of two or more sequences with gap alignments (if any) addressed by a particular mathematical model or computer program (i.e., “algorithms”). The term “similarity” is a related concept but, in contrast to “identity”, includes both identical matches and conservative substitution matches. Identity and similarity of related nucleic acid molecules and polypeptides can be readily calculated by known methods. Preferred methods to determine identity and/or similarity are designed to give the largest match between the sequences tested. Methods to determine identity and similarity are described in publicly available computer programs. Exemplary computer program methods to determine identity and similarity between two sequences include, but are not limited to, the GCG program package, including GAP (Devereux et al., Nucl. Acids. Res. 12: 387, 1984; Genetics Computer Group, University of Wisconsin, Madison, Wis.), BLASTP, BLASTN, and FASTA (Altschul et al., J. Mol. Biol. 215:403-410, 1990)). The BLASTX program is publicly available from the National Center for Biotechnology Information (NCBI) and other sources (BLAST Manual, Altschul et al. NCB/NLM/NIH Bethesda, Md. 20894; Altschul et al., supra). The well-known Smith-Waterman algorithm may also be used to determine identity. Preferred parameters for a polypeptide sequence comparison include the following: Algorithm: Needleman et al., J. Mol. Biol. 48: 443-53 (1970); Comparison matrix: BLOSUM 62 from Henikoff et al., Proc. Natl. Acad. Sci. USA 89:10915-19 (1992); Gap Penalty: 12, Gap Length Penalty: 4; Threshold of Similarity: 0. The GAP program is useful with the above parameters (along with no penalty for end gaps). Preferred parameters for nucleic acid molecule sequence comparisons include the following: Algorithm: Needleman et al., J. Mol. Biol., 48:443-53 (1970); Comparison matrix: matches=+10, mismatch=0, Gap Penalty: 50, Gap Length Penalty: 3. The GAP program is also useful with the above parameters. Other exemplary algorithms, gap opening penalties, gap extension penalties, comparison matrices, thresholds of similarity, etc. may be used by those of skill in the art.

The phrase “stringent hybridization conditions” refers to conditions under which a probe will hybridize to its target subsequence, typically in a complex mixture of nucleic acid, but to no other sequences. Hybridization stringency is principally determined by temperature, ionic strength, and the concentration of denaturing agents such as formamide. Examples of “highly stringent conditions” for hybridization and washing are 0.015M sodium chloride, 0.0015M sodium citrate at 65-68° C. or 0.015M sodium chloride, 0.0015M sodium citrate, and 50% formamide at 42° C. See Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, (Cold Spring Harbor, N.Y. 1989); and Anderson et al., Nucleic Acid Hybridization: Hybridization: a practical approach, Ch. 4, IRL Press Limited (Oxford, England) (1999). Examples of typical “moderately stringent” conditions are 0.015M sodium chloride, 0.0015M sodium citrate at 50-65° C. or 0.015M sodium chloride, 0.0015M sodium citrate, and 20% formamide at 37-50° C. By way of example, a “moderately stringent” condition of 50° C. in 0.015 M sodium ion will allow about a 21% mismatch.

It will be appreciated by those skilled in the art that there is no absolute distinction between “highly” and “moderately” stringent conditions. For example, at 0.015M sodium ion (no formamide), the melting temperature of perfectly matched long DNA is about 71° C. With a wash at 65° C. (at the same ionic strength), this would allow for approximately a 6% mismatch. To capture more distantly related sequences, one skilled in the art can simply lower the temperature or raise the ionic strength.

The nucleotide and amino acid sequences of pGDF5-T5 and pGDF5-Trc are set forth in SEQ ID NOS: 2 and 3 and SEQ ID NO: 4, respectively.

The term “rhGDF-5 or GDF-5” as used herein refers to human growth and differentiation factor-5 (the polypeptide of SEQ ID NO:4 encoded by the polynucleotide of SEQ. ID NO: 1) thereof, or a biologically active fragment, variant, analog, or derivative of the human GDF-5 protein. Exemplary analogs retain 65% or higher amino acid identity to the parent sequence, or 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or higher identity.

As used herein, the term “rhGDF-5 or GDF-5 (pGDF5-T5 or pGDF5-Trc nucleic acid” or “rhGDF-5 or GDF-5 (pGDF5-T5 or pGDF5-Trc) polynucleotide” refers to a nucleic acid that encodes a polypeptide having an amino acid sequence as set forth in SEQ ID NO:4, including a nucleotide sequence as set forth in SEQ ID NO: 1, or nucleic acids comprising nucleotide sequences that are at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical thereto, or nucleic acids which hybridize under moderately or highly stringent conditions as defined herein with the complement of SEQ ID NO: 1 or any other orthologs of the nucleotide sequence of SEQ ID NO: 1.

The terms “polypeptide” and “protein” are used interchangeably herein.

Overexpression, as described herein, encompasses activating (or causing to be expressed) a gene which is normally silent (unexpressed) in the host cell as obtained, as well as increasing the expression of a gene which is not expressed at physiologically significant levels in the cell as obtained.

The present disclosure includes methods and compositions for the production of rhGDF-5 using the T5 or Trc promoter in the production of rhGDF-5 for therapeutic applications. The rh-GDF-5 can be easily produced in large scale quantities in cost-effective, and time-saving manner.

According to the embodiment of the present disclosure, the term “isolated protein” comprises rhGDF-5 or GDF5 protein. In one embodiment, the protein comprises rhGDF-5 or GDF5 having the amino acid sequence set forth in SEQ ID NO:4 and variants and derivatives of this protein, which retain the activity of the polypeptide of SEQ ID NO: 4. In one embodiment, the protein comprises a polypeptide having at least about 80% identity, at least about 85% identity, at least about 90% identity, at least about 95% identity, at least about 98% identity, or at least about 99% identity to the amino acid sequence set forth in SEQ ID NO: 4.

Stable production of proteins, including biologics, can be accomplished by transfecting host cells with vectors containing DNA that encodes the protein. Maintenance of the vector in the cell line can be achieved through a variety of means

With the evolving importance of therapeutic proteins, i.e., biologics, efforts must be made to optimize protein production, while improving efficiency of the overall production process. Thus, improvements in efficiency must be weighed against the protein production capacity of the vector. There is a need for better expression systems that provide efficient cloning options, as well as high levels of the desired protein product. It would be advantageous to decrease the number of cloning steps involved in the production of biologics to improve time requirements and minimize cost. It would also be advantageous to provide vectors that provide adequate protein production for both small and large scale cell cultures.

The expressed recombinant rhGDF-5 protein, according to the embodiment of the present disclosure, can be collected from pGDF5-T5 or pGDF5-Trc transformed host cell lysates (from strains DH10β, STBL2, HMS174 or RecA). The supernatant (soluble fraction) and pellet (insoluble fraction containing inclusion bodies) can be separated by centrifugation. The pellet may then be collected and disrupted or homogenized to release the inclusion bodies from the bacterial cells. Host cell disruption or homogenization may be performed using well known techniques including, but not limited to, enzymatic cell disruption, sonication, dounce homogenization or high pressure release disruption. In one embodiment, the techniques disclosed are used to disrupt the pGDF5-T5- or pGDF5-Trc-transformed E. coli cells to release the inclusion bodies of rhGDF-5 protein.

After cell disruption, the inclusion bodies may then be subjected to solubilization using suitable denaturing agents known in the art. The denaturing agents may be urea or guanidine hydrochloride. The recombinant rhGDF-5 protein can be recovered and purified from the resulting solution by any of a number well known in the art, including but are not limited to, using ion-exchange chromatography, ammonium sulfate or ethanol precipitation, acid or base extraction, column chromatography, affinity column chromatography, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, hydroxylapatite chromatography, lectin chromatography, gel electrophoresis and the like. Protein refolding steps can be used, as desired, in making correctly folded mature proteins. High performance liquid chromatography (HPLC), affinity chromatography or other suitable methods can be employed in final purification steps where high purity is desired. Once purified, partially or to homogeneity, as desired, the rhGDF-5 proteins are optionally used for a wide variety of utilities, including but not limited to, as assay components, therapeutics (biologics), prophylaxis, diagnostics, research reagents, and/or as immunogens for antibody production.

In addition to other references noted herein, a variety of purification/protein folding methods are well known in the art, including, but not limited to, those set forth in R. Scopes, Protein Purification, Springer-Verlag, N.Y. (1982); Deutscher, Methods in Enzymology Vol. 182: Guide to Protein Purification, Academic Press, Inc. N.Y. (1990); Sandana (1997) Bioseparation of Proteins, Academic Press, Inc.; Bollag et al. (1996) Protein Methods, 2nd Edition Wiley-Liss, NY; Walker (1996) The Protein Protocols Handbook Humana Press, NJ, Harris and Angal (1990) Protein Purification Applications: A Practical Approach IRL Press at Oxford, Oxford, England; Harris and Angal Protein Purification Methods: A Practical Approach IRL Press at Oxford, Oxford, England; Scopes (1993) Protein Purification: Principles and Practice 3rd Edition Springer Verlag, NY; Janson and Ryden (1998) Protein Purification: Principles, High Resolution Methods and Applications, Second Edition Wiley-VCH, NY; and Walker (1998) Protein Protocols on CD-ROM Humana Press, NJ; and the references cited therein.

As used herein, the term “exponential growth” refers to that portion of the cellular growth cycle between the lag phase and the stationary phase when cells are doubling at a logarithmic rate. The term “exponential growth” is also meant to encompass the late lag phase (i.e., the early stationary phase) which occurs between the logarithmic growth phase and stationary phase, when the cell growth rate is slowing, and therefor encompasses an extended exponential growth phase. Therefore, “stationary phase” refers to horizontal growth, i.e., when the cells have essentially stopped dividing and have reached a quiescent stage with respect to cell doubling.

As in conventional fermentation processes, it is usually desirable to obtain as high a rate of cell growth in as high a density of bacterial cell culture as possible, to maximize the amount of bacterial biomass produced per unit of time. “Biomass,” as utilized herein and without alteration from its conventional meaning, refers to the mass and/or accumulating mass of host bacterial cells or transforming bacteria cells resulting from the cultivation of such cells using a variety of techniques, e.g., cultivating such cells in defined or semi-defined media containing additional ingredients that may enhance or increase bacterial growth rate or biomass.

According to the embodiment of the present disclosure, fermentation processes have been developed to maximize the yield of pGDF5-T5 and pGDF5-Trc DNAs from large scale cultures of transformed host cells and to optimize recombinant hGDF-5 protein production. The fermentation processes includes optimizing the plasmid yield such that the supply of metabolites essential for growth is adequate to permit growth to a high biomass, but is not in excess so as to inhibit such growth.

According to one embodiment, host cells transformed with an expression vector that includes a pGDF5-Trc or pGDF5-T5 DNA are cultured in a high cell density medium that are modified based on its response to protein expression and biomass yield. For example, a defined media may additionally include ingredients such as sodium molybdate (ranges from about 5 to about 10 mg/L), magnesium sulfate heptahydrate (ranges from about 2 to about 6 mM), sodium chloride (ranges from 0 to about 4 g/L), EDTA (ranges from 0 to about 400 mg/L), MOPS (ranges from 0 to about 100 mM), amino acid supplement including L-methionine (ranges from 0 to about 10 ml/L) and vitamin supplement (folic acid, pyridoxine, and biotin; ranges from 0 to about 10 ml/L). Alternatively, as semi-defined (complex) media may include, in addition to what was in the defined media, yeast extract and tryptone (animal-derived). Both yeast extract and tryptone range from 0 to about 0.4% w/v. Also included were sodium molybdate, magnesium sulfate, sodium chloride, EDTA, MOPS (3[N-morpholino]propane-sulfonic acid), amino acids (including L-methionine), and vitamins (folic acid, pyrodoxine, and biotin).

A type of fermentation according to the embodiment of the present disclosure is fed-batch fermentation, in which the cell growth rate is controlled by the addition of nutrients to the culture during cell growth. As used herein, “fed-batch fermentation” refers to a cell culture process in which the growth rate is controlled by carefully monitored additions of metabolites to the culture during fermentation. Fed-batch fermentation according to the present disclosure permits the cell culture to reach a higher biomass. The key to fed-batch fermentation is supplying substrate at a rate such that it is completely consumed. As a result, residual substrate concentration is approximately zero and maximum conversion of substrate is obtained. Metabolic overflow from excess substrate is avoided, reducing the formation of inhibitory acetate. Fed-batch fermentation starts with a batch phase. Cells are inoculated into an initial volume of medium that contains all non-limiting nutrients and an initial concentration of the limiting substrate. Controlled feeding of the limiting nutrient begins once the cells have consumed the initial amount of substrate. One of the simplest and most effective feeding strategies is exponential feeding. This method allows the culture to grow at a predetermined rate less than mu (μ)_(max) without the need of feedback control. The fermentation begins with a batch mode containing a non-inhibitory concentration of substrate. The cells grow at mu (μ)_(max) until the substrate is exhausted, at which point the nutrient feeding begins.

The DO-stat and pH-stat methods are fairly easy to implement since most standard fermentor systems include dissolved oxygen and pH monitoring. Trends in dissolved oxygen (DO) and pH can indicate whether substrate is available to the cells. Exhaustion of substrate causes decreased oxygen uptake and the DO concentration in the medium rises. The pH also rises due to consumption of metabolic acids. Feeding is triggered when DO or pH rises above a set threshold. The growth rate can be adjusted by changing the DO or pH threshold value.

Examples

Reference will now be made in detail to certain embodiments of the present disclosure. While the present disclosure will be described in conjunction with the illustrated embodiments, it will be understood that they are not intended to limit the present disclosure to those embodiments. On the contrary, the present disclosure is intended to cover all alternatives, modifications, and equivalents that may be included within the present disclosure as defined by the appended claims. The headings below are not meant to limit the disclosure in any way; embodiments under any one heading may be used in conjunction with embodiments under any other heading.

Nucleotide and amino acid sequences are referred to herein by a sequence identifier number (SEQ ID NOS:1 to 39). A sequence listing is provided at the end of the specification.

GDF5-T5 and GDF5-Trc Plasmid/Expression Vector Construction

Two plasmid expression vectors (pGDF5-T5 and pGDF5-Trc) were engineered for the purpose of expressing a recombinant human growth and differentiation factor-5 (rhGDF-5) of about 13.5 kDa in selected prokaryotic host cell strains. Plasmid maps for both plasmids are provided in FIGS. 1A and 1B. The sizes of pGDF5-T5 and pGDF5-Trc are 4299 and 4403 bp, respectively. The complete nucleotide sequence of the pGDF5-T5 and pGDF5-Trc is designated in the Sequence Listing as SEQ ID NOS: 2 and 3, respectively.

To do this, a starting IP constraint-free plasmid, pJExpress-401 (DNA2.0, Menlo Park, Calif. having either a T5 or a Trc promoter (both IPTG-inducible) was employed. Some of the features of pJExpress-401 include a pUC origin, a Kanamycin selective marker gene (Kan^(R)) gene, either a T5 or a Trc promoter (both IPTG (isopropylthio-β-galactoside)-inducible) and an Nde-1 restriction site. The gene of interest (gene insert) is a codon-optimized human GDF-5 (rhGDF-5) cDNA having a size of 528 bp designated in the Sequence Listing as (SEQ ID NO:1). The reverse complimentary coding sequence of the insert sequence, as underlined is provided in the sequence listing as SEQ ID NO:7.

Plasmid DNA Analysis

As indicated above, the complete DNA sequences for pGDF5-T5 and pGDF5-Trc are provided in the sequence listing as SEQ ID NOS:2 and 3, respectively. The theoretical protein sequence encoded by pGDF5-Trc is denoted herein as SEQ ID. NO. 4. Sequencing was performed on pGDF5-T5 and pGDF5-Trc with a minimum of 2× coverage over the backbone and 4× coverage over the gene of interest (GDF-5 DNA insert). The samples were run on the ABI 3130×1 genetic analyzer and analyzed using ABI's Sequencing Analysis software version 5.3.1. The sequences were edited using Sequencher™ version 5.0. The sequences obtained were assembled into contiguous sequence files. The consensus sequence (MD1.seq corresponding to SEQ ID NO:3) was compared with the corresponding expected reference sequence file (MD1.txt corresponding to SEQ ID NO:6). The sequence file comparisons and additional data are discussed hereinbelow.

Plasmid DNA and Sequencing Primers Preparation and Purification:

Prior sequencing, the concentration of purified plasmid GDF5-Trc and GDF5-T5 DNA (pGDF5-Trc and pGDF5-T5) was determined using the Smartspec™ 3000 spectrophotometer. Sequencing Primers (oligonucleotides) were designed accordingly and the following primers, as well as the polynucleotides and polypeptides according to the embodiments of the present disclosure, are listed on Table 1. The sequencing primers were utilized to sequence pGDF5-Trc.

TABLE 1 SEQ ID NO. Description Sequence Species/Type Length Start Direction Tm % GC 1 GDF-5 DNA See Sequence Listing human/DNA 528 insert - insert DNA from pGDFS-Trc plasmid 2 complete See Sequence Listing human/DNA 4299 sequence of pGDFS-T5 DNA 3 complete See Sequence Listing human/DNA 4403 sequence of pGDFS-Trc DNA 4 theoretical See Sequence Listing human/protein 125 amino acid sequence of rhGDF-5 protein 5 GDFS-CofA See Sequence Listing human/protein 120 from Prospec- TanyTechnoGene Ltd 6 MD1.txt See Sequence Listing human/DNA 4405 reference GDF-5 sequence for Sequencing 7 Reverse See Sequence Listing human/DNA 528 complementary strand insert sequence from pGDFS-Trc plasmid 8 MDP1.1SF1-A CTATCATGCCATACCGCGAAA artificial sequence 21 35 Forward 60 48 9 MDP1.2SF1-A GCCAGCCATTACGCTCGTC artificial sequence 19 382 Forward 60 63 10 MDP1.3SF1-A CGCTACCTTTGCCATGTTTCA artificial sequence 21 721 Forward 60 48 11 MDP1.4SF1-A TAATCGCGGCCTCGACG artificial sequence 17 857 Forward 60 65 12 MDP1.5SF1-A CCTGACCCCATGCCGAA artificial sequence 17 1105 Forward 60 65 13 MDP1.6SF1-A AGTTAGCGACAGCCGCAGC artificial sequence 19 1328 Forward 60 63 14 MDP1.7SF1-A ATGGCTACGCAGCGGAAAC artificial sequence 19 1516 Forward 60 58 15 MDP1.8SF1-A GCGGCATATGTTTTACCTCCTG artificial sequence 22 1686 Forward 59 50 16 MDP1.9SF1-A AGCTCGTAATTGTTATCCGCTCA artificial sequence 23 1812 Forward 59 43 17 MDP1.10SF1-A CAAGCAAAGTGACAGGCGC artificial sequence 19 2214 Forward 59 58 18 MDP1.11SF1-A GGCGGTAATACGGTTATCCACA artificial sequence 22 2542 Forward 60 50 19 MDP1.12SF1-A TGCGCCTTATCCGGTAACTATC artificial sequence 22 2933 Forward 59 50 20 MDP1.13SF1-A TTTTGGTCATGAGTCACTGC artificial sequence 20 3311 Forward 53 45 21 MDP1.14SF1-A GGAACGATGCCCTCATTCAG artificial sequence 20 3691 Forward 59 55 22 MDP1.15SF1-A CCAGCGGATAGTTAATGATCAGC artificial sequence 23 4022 Forward 59 48 23 MDP1.16SF1-A CCGGCATACTCTGCGACATC artificial sequence 20 4366 Forward 60 60 24 MDP1.17SR1-A GATGTCGCAGAGTATGCCGG artificial sequence 20 21 Reverse 60 60 25 MDP1.18SR1-A CATTAACTATCCGCTGGATGACC artificial sequence 23 368 Reverse 59 48 26 MDP1.19SR1-A GCCAACGATCAGATGGCG artificial sequence 18 732 Reverse 60 61 27 MDP1.20SR1-A TGACCAAAATCCCTTAACGTGAGT artificial sequence 24 1087 Reverse 60 42 28 MDP1.21SR1-A GATAGTTACCGGATAAGGCGCA artificial sequence 22 1452 Reverse 59 50 29 MDP1.22SR1-A CCTGCGTTATCCCCTGATTCT artificial sequence 21 1823 Reverse 59 52 30 MDP1.23SR1-A AAACGACGGCCAGTCTTAAGCT artificial sequence 22 2029 Reverse 60 50 31 MDP1.24SR1-A AACGTAAAAACCCGCTTCGG artificial sequence 20 2099 Reverse 60 50 32 MDP1.25SR1-A CGCCTGTCACTTTGCTTGATA artificial sequence 21 2175 Reverse 58 48 33 MDP1.26SR1-A TGAGCGGATAACAATTACGAGCT artificial sequence 23 2572 Reverse 59 43 34 MDP1.27SR1-A TTGCTCCCGTAAAGCCCTG artificial sequence 19 2767 Reverse 60 58 35 MDP1.28SR1-A CCCGATCTCTATTCTGTTCATCG artificial sequence 23 2986 Reverse 59 48 36 MDP1.29SR1-A TACGGCGTTTCACTTCTGAGTTC artificial sequence 23 3265 Reverse 59 48 37 MDP1.30SR1-A GGTGCGACAATCTATCGCTTG artificial sequence 21 3613 Reverse 59 52 38 MDP1.31SR1-A GATCGCGTATTTCGCCTCG artificial sequence 19 3934 Reverse 60 58 39 MDP1.32SR1-A CTGCCTCGGTGAGTTTTCTCC artificial sequence 21 4206 Reverse 60 57

The sequencing primers were diluted to a final concentration of 1.6 pmol/pL and the cycle sequencing reactions were setup in a 96-well plates. The cycle sequencing plates were then loaded onto the ABI Veriti® Thermal Cycler for a cycle sequencing run based on the following cycling conditions (see Table 2):

TABLE 2 Cycling Conditions: 98° C. 5 minutes 30 cycles: 96° C., 30 seconds 50° C., 10 seconds 62° C., 4 minutes Hold:  4° C., 00

Sample purification (dye terminator removal) occurred after the cycle sequencing run. The samples were purified using Qiagen® Dye Ex 2.0 Spin Kit. The samples were eluted with Hi-Di Formamide and transferred to 96-well plates. The 96-well plates were denatured at 95° C. for 2 minutes in the ABI Veriti® Thermal Cycler.

Sequencing Analysis:

The 96-well sequencing plates were run on the ABI 3130×1 Genetic Analyzer using ABI's Data Collection Software version 3.0. The sequencing data (electropherograms) were analyzed using ABI Sequence Analysis software version 5.3.1. The sequences were edited and assembled into contiguous sequence files using Sequencher™ Version 5.0. The following ambiguity codes may appear in the sequence files:

TABLE 3 Symbol: Meaning: 1 Probable C 2 Probable T 3 Probable A 4 Probable G R A or G Y C or T M A or C K G or T W A or T S G or C H A or C or T B G or T or C V G or C or A D G or T or A N A, C, G, or T

DNA Sequencing Analysis Summary:

Of the 4405 bps provided, 4403 bps was sequenced. The plasmid pGDF5-Trc DNA, designated herein as SEQ ID NO:3 was sequenced in full with a minimum of 2× coverage over entire plasmid and 4× coverage over the insert (bp 1324-4851). The insert sequence was conforming and there were 3 discrepancies found outside the coding region (DNA insert): (i) one (1) ambiguity (Y at consensus position bp: 116) and 2 discrepancies (ii) two (2) deletions in supplied sequence (bases that appear in sequence (MD1.txt of SEQ ID NO:6) but not in the sequenced data (MD1.seq of SEQ ID NO:3)) at consensus positions bp: 86 and 87). Protein alignment between the theoretical amino sequence encoded by pGDF5-Trc shows that it is about 99% identical to the amino acid sequence of a commercially-known recombinant human GDF-5 protein (Catalog No. CYT-442; Prospec-TanyTechnoGene Ltd, Rehovot, Israel). See FIG. 1C and Sequence ID NOS:4 and 5, respectively).

pGDF5-T5 and pGDF5-Trc Transformation

Four Escherichia coli bacteria host cell lines were selected for pGDF5-T5 and pGDF5-Trc transformation: (1) DH10β (genotype: F-, mcrA A(mrr-hsdRMS-mcrBC) φ80lacZΔM15 ΔlacX74 recA1 endA1 araD139 Δ (ara, leu)7697 galU galK λ-rpsL nupG/pMON14272/MON7124); (2) STBL2 (genotype: F-, mcrA Δ(mcrBC-hsdRMS-mrr) recA1 endA1lon gyrA96 thi supE44 relA1 Δ(lac-proAB); (3) HMS174 (genotype: F-, recA1 hsdR(rK12-mK12+)(RifR); and (4) recA (genotype: recA1819 complete gene deletion).

A. Preparation of Media and Agar Plates

“Select APS (Alternative Protein Source) LB Media” was prepared by dissolving 20 g of Select APS LB broth base powder in 1 L of purified water with a pH that ranged from about 6.6 to about 7.1. LB agar plates were prepared by adding 7.5 g of Agar, U.S.P. into 500 mL of Select APS LB media. Both APS LB media and agar were autoclaved for 121° C. to 123° C. for ≧45 min on liquid cycle. After cooling to 40-60° C., Kanamycin antibiotic was added to the media and agar plates at a concentration of 50 μg/mL.

B. Host Cell Transformation

Plasmid GDF5-T5 or pGDF5-Trc DNA (SEQ ID NOS:2 and 3, respectively) and a competent cell E. coli host cell, either from strain DH10β, STBL2, HMS174 or recA, were each separately mixed together in a tube and incubated on an ice bath for at least about 30 minutes, heat shocked for 45 seconds at 42° C.±2° C.) and immediately placed on ice for 2-5 min. About 450 μl of SOC media (Bacto tryptone 20 g/L; Bacto yeast extract, 5 g/L; NaCl, 0.5 g/L; MgCl₂.6H₂O 2.03 g/L; glucose 3.6 g/L) was then added into each bacteria-plasmid DNA mixture and placed onto 37° C. (±1° C.) shaker incubator and shaked at about 225 to about 275 rpm for 60 minutes (±5 minutes). To prevent lowering of a dissolved oxygen concentration, the shaker incubator was sped up to keep the dissolved oxygen concentration at 50% of air saturation. After 1 hour, an aliquot of the transformed cells were aseptically plated into APS LB/Kn agar plates and incubated overnight at 37° C. (±1° C.) for about 14 to about 24 hours. The cultivation was proceeded by adding 50% glucose solution at a level of 0.2% to obtain a high cell density, with an indication of abrupt increase of the dissolved oxygen concentration. The final pH of the growth medium was at about a pH of 7.

Table 4 lists the different transformed groups with their corresponding host cell strain and vector constructs:

TABLE 4 Transformed Groups Host Cell Line Vector Construct A DH10β (Clones 1-5) pGDF5-T5 B DH10β (Clones 1-5) pGDF5-Trc C STBL2 (Clones 1-5) pGDF5-T5 D STBL2 (Clones 1-5) pGDF5-Trc E HMS174 (Clones 1-5) pGDF5-T5 F HMS174 (Clones 1-5) pGDF5-Trc G recA (Clones 1-5) pGDF5-T5 H recA (Clones 1-5) pGDF5-Trc

pGDF5-T5 and pGDF5-Trc constructs were each successfully transformed into each four host cell lines. Clonal selection and expression screening were conducted by first performing a small scale fermentation with IPTG induction (1 mM final concentration). For STBL2, clonal selection was performed at 30° C. The presence of pGDF5-T5 or pGDF5-Trc DNA (SEQ ID NOS:2 and 3, respectively) was each confirmed by restriction enzyme analysis with NdeI. GDF5 protein analysis was assessed through SDS-PAGE and Western blotting

To monitor the transformation process, a positive (+) control vector construct, pJExpress, was used. A clone that expresses the pJExpress construct showed an over-expression of a fluorescent protein of about 30 kDa protein after IPTG induction (data not shown).

Expression Screening of pGDF5-T5 and pGDF5-Trc Constructs

A. DNA-Restriction Enzyme Analysis:

Five bacterial colonies (Clones 1-5) from each transformed groups, Groups A-H as listed in Table 4, were initially picked for overnight growth in APS LB media containing 50 μg/ml of Kanamycin (APS LB/Kan). The next day, two (2 ml) of APS LB/Kan bacterial culture from each colony of each transformed groups, were grown at 37° C. (±1° C.) overnight. Plasmid DNAs were extracted and purified using a QIAprep® Spin Miniprep Kit (Qiagen, Inc, Valencia, Calif.). The purified DNAs were linearized with NdeI restriction enzyme and run through a 0.8% agarose gel to verify for the presence of the pGDF5-T5 and pGDF5-Trc constructs. Both linearized pGDF5-T5 and pGDF5-Trc DNAs have an expected size of 4299 bp and 4403 bp, respectively. As shown in FIGS. 2A and 2B, successful transformation of pGDF5-T5 and pGDF5-Trc DNA into DH10β and STBL2 (grown at 30° C.) host cells clones 1-5, respectively, was confirmed. Transformation of pGDF5-T5 and pGDF5-Trc (Clones 1-5, respectively) on HMS174 and recA host cell strains was also successful, the results of which are not presented herein.

B. Preparation of E. coli Inclusion Bodies (IB)

Positive transformants from each Group (A-H) as listed in Table 4 were cultured according to the above-mentioned method. Cells from the culture broth of each transformant were harvested and resuspended in TE buffer (25 mM Tris and 10 mM EDTA, pH 7.3). To collect inclusion bodies that contain the highly purified concentrated rhGDF-5 protein, cells were broken up by means of a homogenizer and spun down to collect the pellet (or precipitate) that contain the inclusion bodies. The inclusion bodies were washed with wash buffer and centrifuged for a period of time at 4° C. The collected pellet was solubilized by sonicating in solubilization buffer. After solubilization, the solution containing the rhGDF-5 protein was centrifuged for a period of time at 4° C.

To obtain high purity with the highest maximum yield yet low oxidation and minimal related impurities, the resultant supernatant was subjected to a weak cation exchanger resin, Toyopearl™ CM-650 from Tosoh Bioscience LLC, King of Prussia, Pa. Briefly, the CM-650 column was first equilibrated with buffer before the resultant supernatant was applied to the CM-650 column. The CM-650 column was washed before eluting with the same buffer modified with a salt to elute the proteins off the column.

C: SDS-PAGE and Western Blotting

To screen for bacterial clones capable of expressing rhGDF-5 protein, five single colonies from each group as listed in Table 1, were further inoculated in 4 ml of APS LB agar plates containing 50 μg/ml of Kanamycin (APS LB/Kan) and grown at 37° C. When the bacterial culture grew to OD₆₀₀ from about 0.4 to about 0.6, they were induced with or without IPTG (1 mM final concentration; ±IPTG). Non-induced and Induced cultures were each harvested and treated with Novagen™ BugBuster Protein Extraction Reagent. Supernatant (soluble proteins) and pellet (insoluble proteins) from each culture, either IPTG-induced or uninduced, were then analyzed by SDS-PAGE.

For SDS PAGE, a total of 20 μL sample (13 μl of sample, 2 μl of reducing agent and 5 μl of sample loading buffer) was prepared for gel loading. The 20 μL-samples were boiled at 95° C. for 5 minutes and loaded either as 5 μl or 20 μl for larger volume loading. All positive clones showed the presence of the expected 13.5 kDa rhGDF-5 protein.

rhGDF-5 expression was observed in the pellets of clones from all of the host cell strains tested, regardless of which vector constructs was used in the transformation. Overexpression of rhGDF-5 protein was particularly observed in clones from HMS174 and RecA strains and was better than those of DH10β and STBL2 ((data not shown). In addition, a higher rhGDF-5 over-expression was observed with longer IPTG induction time (results not included). The size-wise comparison of the over-expression band with the reference protein, rhGDF5 was further confirmed by Western blotting analysis using an anti-human GDF-5 antibody.

From the Western blot results obtained, GDF-5 protein was found in the bacterial pellet fraction (see FIG. 3A) and not in the supernatant fraction (see FIG. 3B). Also, the expression level using the GDF5-TRC construct was better than that of pGDF5-T5 construct. Among the 4 different host cell lines expressing pGDF5-TRC, the HMS174 cell strain provided the optimally-expression levels of the GDF-5 protein.

Characterization of the Over-Expressing GDF-5 Clones

About 400 μl±2% of thawed pGDF5-Trc- or pGDF5-T5-transformed clones were aseptically and inoculated into two separate flasks of pGDF5-T5 and pGDF5-Trc-transformed RecA or HMS174 (Flask 1 for growth monitoring and Flask 2 for storage processing) in the growth medium. Each of the inoculated flasks were shaked at 220 to 250 rpm at a temperature of about 37° C.±1° C. until an optical density (OD₆₀₀) of about 1 to about 3 was reached. After 8 hours of fermentation (EFT8; Elapsed Fermentation Time), hourly sampling from Flask 1 was taken for optical density measurements. Sampling may occur prior to EFT 8 and when OD₆₀₀ of Flask 1 reached between 0.8 and 1.0, the OD₆₀₀ of Flask 2 was also measured.

Of the forty clones (5 clones per each transformed group) examined, two clones, Clones #1 and 4 derived from pGDF5-TRC-transformed HMS174, were selected and evaluated for further studies, e.g., ability to express and produce GDF-5 and growth profile and rate (see FIGS. 4A and 4B, respectively).

The growth profile study was done to understand how these cells grow in the lag, exponential and stationary phases. The data obtained may then allow for profiling growth rates, population doubling time (expressed as p), carbon consumption and waste production.

Growth rate constant, according to the embodiment of the present disclosure, can be defined as the number of generations that occur per unit time (expressed as μ or mu), where (1) μ=(ln N2−ln N1)/(t2−t1) where N2 and N1=cells ml-1 at time t2 and t1 (in h); and (2) convert (1) to log: μ=(log N2−log N1) (2.303)/t2 and t1).

As depicted in FIG. 4A, pGDF5-Trc-transformed HMS174 Clones 1 and 4 both overexpress the rhGDF-5 protein. However, the growth profile of Clone 4 was better than that of Clone 1 (with Clone 4 having a higher mu or μ value for growth rate), despite the similarity in their OD₆₀₀ values starting from EFT0 to EFT18.5-19.0 (see FIG. 4B). Growth characterization and protein production were further conducted by growing the transformants using two fermentation methods: (1) ultra yield shake flask (UY SF) and (2) 5 L Applikon Fermentor (Ferm).

The ultra yield shake flask (UY SF) fermentation method, as discussed above, may be used to easily make the recombinant protein material without the use of a fermentor. This technology is disposable, and is product dedicated. It stimulates a fermentation environment but does not require the infrastructure and laborious setup that actual bioreactor fermentations need. It has a similar set up like a disposable shake flask and allows for the manufacturing of the recombinant protein material that is closely representative of what can be made in a fermentor to support downstream and analytical development of the recombinant protein

The 5 L fermentation method, on the other hand, is a non-GMP batch induction fermentation method for production of the recombinant GDF-5 protein. It involves the inoculation of 400 mL of seed media with 1000 μl seed ampoule of pGDF5-Trc-transformed Clone 1 or 4 and shaking the media at 250 rpm, for 8 hrs at 37° C. This was followed by inoculating about 200 mL of seed culture into the 5-L fermenter with the following fermentation parameters:

Fermentation Parameters: Temperature 37° C., Stirring: 330-1322 rpm, Airflow: 4 L/min, pH control: 6.8+/−0.2 with ammonium hydroxide and 50% phosphoric acid, dissolved oxygen: 30%, and anti-foam 204 as needed.

When the OD₆₀₀ reaches at about 0.6-0.8, 1 mM IPTG was added for induction. After 14-18 hours of post-induction, the bacterial culture was harvested.

The goal for exploring these methods is to develop a recombinant protein production that is not laborious and simple for analytical and purification approach. These two methods vary in terms of the following as illustrated in Table 5 and in the results obtained as shown in FIG. 5A and FIG. 5B.

TABLE 5 UY Shake Flask 5L Applicon Fermentor (FIG. 6A) (FIG. 6A) pH Control APS Super Broth with APS Super Broth with 100 mM MOPs H3PO4 and NH4OH pH Shift pH shift to more acidic may pH maintained at cause protein induction 7.0 ± 0.2 Growth lower growth rate growth rate was above 1 Rate at induction at induction

A better GDF5 protein expression was observed when UY SF method was employed (see SDS-PAGE as shown in FIG. 6A), but the overall GDF5 expression may appear equivalent between Clones 1 and 4 (see western blot as shown in FIG. 6B, lanes 6 and 10). In addition, the pellet fraction from Clone 1 UY SF (sample PF030311A, lane 10) appeared to have a more dense GDF5-band (see FIG. 6B). Supernatant (soluble proteins) and pellet (insoluble proteins) samples from pGDF5-Trc-transformed Clones 1 and 4 were analyzed using SDS-PAGE and have the following designations:

TABLE 6 Sample Designated Sample Source Description pGDF5-Trc Clone # Supernatant SF020311A Clone 1 UY SF Pellet PF020311A Clone 1 UY SF Supernatant SF020311B Clone 4 UY SF Pellet PF020311B Clone 4 UY SF Supernatant SF020311C Clone 1 5L Ferm Pellet PF020311C Clone 1 5L Ferm Supernatant SF020311D Clone 4 5L Ferm Pellet PF020311D Clone 4 5L Ferm Maximizing pGDF5-Trc DNA Yield and Optimal Recombinant GDF-5 Protein Production

High plasmid DNA yield and recombinant protein production, while still cost-effective, are important for the manufacturing of rhGDF-5 biologics. Therefore, to attain the maximum pGDF5-Trc DNA yield and rhGDF-5 protein production, efforts were made to obtain a high cell density fermentation for DNA production. Specifically, the type of production (batch, fed-batch or continuous fermentation), type of media and components and growth control strategies were considered. To achieve these goals, the inventors added several components or ingredients to a high cell density media (formulation as described in FIG. 7) to determine their effects of cell growth, GDF5-Trc DNA plasmid yield and rhGDF-5 protein expression without compromising GDF5-Trc plasmid quality and GDF-5 protein expression. A two level fractional factorial designed of experiment (DoE) was performed in Thomson 24-microwell plates using either a defined (minimal) or semi-defined (complex) media. Statistical software was employed to evaluate these various formulations for fermentation media.

A. Defined Media DoE (7-Factor DoE Design Space)

The defined media additionally includes ingredients such as sodium molybdate (ranges from about 5 to about 10 mg/L), magnesium sulfate heptahydrate (ranges from about 2 to about 6 mM), sodium chloride (ranges from 0 to about 4 g/L), EDTA (ranges from 0 to about 400 mg/L), MOPS (ranges from 0 to about 100 mM), amino acid supplement including L-methionine (ranges from 0 to about 10 ml/L) and vitamin supplement (folic acid, pyrodoxine, and biotin; ranges from 0 to about 10 ml/L). Center points were added to detect for curvature and residual testing and lack-of-fit testing were included in the studies. The transformed bacteria were grown in Thomson 24-well microplate at 37° C., 250 rpm, induced after 4 hours of elapsed fermentation time (EFT4) with 1 mM IPTG and harvested after incubating for an additional 14 hours. Chemical lysis was performed on harvested samples. Bacterial pellet samples were run on SDS-PAGE. Gels were analyzed using ImageJ densitometry software (see FIGS. 8A-B).

B. Semi-Defined Media DoE

The semi-defined (complex) media included, in addition to what was in the defined media, yeast extract and tryptone (animal-derived). Both yeast extract and tryptone range from 0 to about 0.4% w/v. Also included were sodium molybdate, magnesium sulfate, sodium chloride, EDTA, MOPS (3[N-morpholino]propane-sulfonic acid), amino acids (including L-methionine), and vitamins (folic acid, pyrodoxine, and biotin). The center points were added to detect for curvature and residual testing were included. The transformed bacteria were grown in Thomson 24-well microtitreplate in high-throughput minibioreactor system at 37° C., 1000 rpm. During this time, biomass, pH and pO2 were measured at 15 minute interval. The bacteria culture was induced after 4 hours of elapsed fermentation time (EFT4) with 1 mM IPTG and harvested after incubating for an additional 14 hours. Chemical lysis was performed on harvest samples and bacterial pellet samples were run on SDS-PAGE and analyzed using ImageJ densitometry software (see FIGS. 9A-D).

Based on the results obtained from the two-level fractional factorial design, three other high cell density (HCD) growth media were tested and further developed. They differ from each other with respect to their optimized response to either rhGDF5 expression (protein production) or biomass yield (growth rate) as follows: Media 1 (defined media improved rhGDF-5 expression) included sodium molybdate, magnesium sulfate and sodium chloride. Expression of rhGDF-5 was increased by increasing sodium molybdate, magnesium sulfate and sodium chloride in defined media (see Tables 6-7 and FIGS. 8A-B). Media 2 (semi-defined media improved rhGDF-5 expression) included magnesium sulfate and yeast extract. Expression of rhGDF-5 was increased in the presence of yeast extract and magnesium sulfate but decreased when sodium molybdate was present (see Tables 7-8 and FIGS. 9A-B). Media 3 (semi-defined media improved biomass yield) included yeast extract and tryptone while being negatively affected by sodium chloride and MOPS (see Tables 7-8 and FIGS. 9C-D).

As demonstrated in Table 7 and 8, Media 1 and 2 both enhanced the expression of rhGDF-5 while Media 3 improved or optimized biomass yield.

TABLE 7 Optimized Harvest rhGDF-5 Band Amount Media Media Type Response OD₆₀₀ (μg/mL) 1 Defined rhGDF-5 7.58 0.133 expression 2 Semi-Defined rhGDF-5 8.3 1.03 expression 3 Semi-Defined Biomass 9.06 0.040 4 Super Broth NA 15.1 0.105 (Complex)

It was also noted that scale-up to 1 L volumes precipitated Media 2 while reduction in phosphates allowed the expression of rhGDF-5 and prevented precipitation of magnesium phosphate. Furthermore, substituting tryptone with peptone, a non-animal derived alternative, increased rhGDF-5 expression. Data not included herein.

Similar findings were obtained with a smaller scale fermentation (24-well plate) and the 2.5 L Ultra Yield Flask scale up fermentation methods with respect to the effect of Media 1 and 3 on GDF5 on biomass yield (see Table 8). Their effect on GDF-5 protein expression was not the same. Enhanced GDF5 protein expression was demonstrated when cultures were grown on the 24-well plate with Media 2 but not on Media 1 and Media 3. See data from Table 8. A problem was encountered with Media 2 during the 2.L-Ultra Yield Flask scale-up fermentation run. The solution precipitated during overnight storage which may have been due to magnesium sulfate reacting with phosphate-buffering system to form insoluble magnesium phosphates.

TABLE 8 Media Alternatives 2.5 L UY Confirm. Run 24 Well Plates rhGDF-5 rhGDF-5 rhGDF5 rhGDF-5 rhGDF 5 rhGDF-5 band amount per band amount per band amount per Media Optimized Harvest amount ml ferm Harvest amount ml ferm Harvest amount ml ferm Media Type Response ID OD600 (μg) (μg/ml) OD600 (μg) (μg/ml) OD600 (μg) (μg/ml) 1 Defined rhGDF-5 N/A 8.64 0.6678 14.99 7.25 2.596 24.15 7.58 0.1402 1.023 expression 2 Semi rhGDF-5 2a 9.82 0.1913 6.278 7.79 0.4379 4.378 8.3 0.9934 7.936 defined expression 2b 9.82 0.7756 16.37 3 Semi Biomass 3 11.96 0.1406 10.11 12.2 3.343 52.34 9.06 0.0351 0.3060 defined 3a 10.12 0 0 3b 10.42 0 0 3c 11.42 1.172 17.17 5B Super N/A N/A 22.72 1.609 46.92 19.06 2.181 53.34 15.1 0.1668 2.425 Broth (Complex)

Effects of pH and Oxygen on the GDF-5 Protein Production and Expression

To study the effects of pH and oxygen on GDF-5 protein production and expression, pGDF5-Trc-transformed host cells (HMS174 strain, Clone F031512 were grown under the following growth conditions: (i) pH 6.5 (condition A); ii) pH 7.1 (condition B); and iii) pH 6.8 at low oxygen (condition C) and induced with IPTG. Samples designated hereinafter as F031512A, F031512AB, and F031512A C from each of the three conditions were normalized to 4.0 at OD₆₀₀ by dilution into BugBuster® Plus Lysonase™ (Novagen®) before separately running them under reducing and non-reducing SDS-PAGE conditions. Each of the sample pellets (EFT22, EFT24, EFT26 and EFT28 were resuspended in equal volume of water and added to 4× reducing sample buffer. All 3 SDS-PAGE gels (see FIGS. 11A-C), i.e., Samples F031512A, B, and C loaded on Gels A, B, and C, respectively, were stained and de-stained in the same gel tray. Intensity and banding pattern of the reference standards varied greatly from gel to gel. Densitometry data was generated comparatively. All calculations were based on standards value from the gel that produced a single band standard (Gel C). Though the overall yield estimations accuracy maybe questionable but their relative comparisons should be valid since the molecular weight ladder showed consistent band intensity at molecular weights similar to the protein of interest.

As shown in FIG. 11E, the F031512A samples resulted in a higher ratio of the two major expressed protein bands (40 kDa:14 kDa) than Samples F031512B and F031512C. Low oxygen (condition C) resulted in the lowest ratio of 40 kDa:14 kDa (FIG. 11E). Among the three conditions tested, cells that grew under growth media of pH 7.1 (condition B) had the highest overall GDF5 protein expression (FIG. 11B) and growth rate (see FIG. 11D) than cells that grew under condition A (growth media of pH 6.5; see FIG. 11A) and condition C (low oxygen condition; see FIG. 11C). Extreme reduction in dissolved oxygen leading to anaerobic conditions may not improve protein expression but can influence the ratio of 40 kDa:14 kDa (see FIGS. 11C and 11E).

It will be apparent to those skilled in the art that various modifications and variations can be made to various embodiments described herein without departing from the spirit or scope of the teachings herein. Thus, it is intended that various embodiments cover other modifications and variations of various embodiments within the scope of the present teachings. 

1. An expression vector comprising: a T5 promoter operably linked to a polynucleotide sequence that encodes a GDF-5 protein.
 2. An expression vector of claim 1, further comprising a kanamycin resistance (Kan′) gene and a pUC origin of replication.
 3. An expression vector of claim 1, wherein the T5 promoter is inducible by isopropylthio-β-galactoside (IPTG).
 4. An expression vector of claim 1, wherein the expression vector further comprises a nucleic acid encoding a recombinant human GDF-5 protein having the amino acid sequence of SEQ ID NO:
 4. 5. An expression vector of claim 1, wherein the polynucleotide sequence is SEQ ID NO: 1 that encodes the polypeptide sequence of SEQ ID NO: 4 and the polypeptide sequence is at least 95% identical to SEQ ID NO:
 4. 6. An expression vector of claim 1, wherein the vector is a plasmid vector that is pGDF5-T5.
 7. An expression vector of claim 6, wherein the pGDF5-T5 comprises the polynucleotide sequence of SEQ ID NO:
 2. 8. An expression vector of claim 1, wherein the polynucleotide sequence comprises a codon-optimized human rhGDF-cDNA of SEQ ID NO: 1 having a size of 528 bp.
 9. An expression vector of claim 1, wherein the GDF-5 protein is an rhGDF-5 protein.
 10. A host cell line engineered to express rhGDF-5 protein by the expression vector of claim
 1. 11. A host cell line of claim 10, wherein the host cell line is an Escherichia coli cell comprising the strain of DH10β, STBL2, HMS174 and RecA.
 12. A host cell line of claim 10, wherein the host cell line is engineered to express rhGDF-5 protein comprising at least 95% identity to the amino acid sequence of SEQ. ID NO:
 4. 13. A method for producing an rhGDF-5 (rhGDF-5) protein, the method comprising: providing a prokaryotic host cell comprising an expression vector which comprises a polynucleotide encoding an rhGDF-5 protein under the control of a T5 or Trc promoter; and cultivating the prokaryotic host cell under suitable conditions so as to induce or promote the expression of the polynucleotide in the expression vector and recovering the rhGDF-5 protein.
 14. A method of claim 13, wherein the prokaryotic host cell is an Escherichia coli cell from the strain of DH10β, STBL2, HMS174 and RecA and the expression vector further comprises a polynucleotide encoding an rhGDF-5 protein under the control of an inducible promoter.
 15. A method of claim 13, wherein the polynucleotide is SEQ ID NO: 1 and the protein sequence is SEQ ID NO:
 4. 16. A method of claim 13, wherein the expression vector comprises a promoter that is inducible by isopropylthio-β-galactoside (IPTG).
 17. A method of claim 13, wherein the expression vector further comprises a nucleic acid encoding a recombinant human GDF-5 protein having the amino acid sequence of SEQ. ID NO: 4 and the amino acid sequence is at least 95% identical to SEQ ID NO:
 4. 18. A method of claim 13, wherein the expression vector is a plasmid vector that is pGDF5-T5.
 19. A method of claim 18, wherein pGDF5-T5 comprises a polynucleotide sequence of SEQ ID NO:
 2. 20. A method of claim 13, wherein the polynucleotide sequence comprises a codon-optimized human rhGDF-cDNA of SEQ ID NO: 1 having a size of 528 bp and the polypeptide sequence is an rhGDF-5 protein. 